On Continuous Optimization Methods in Data Mining — Cluster Analysis, Classification and Regression — Provided for Decision Support and Other Applications

نویسندگان

  • Tatiana Tchemisova
  • Başak Akteke-Öztürk
  • Gerhard Wilhelm Weber
چکیده

Optimization is an area of mathematics that finds an optimum (minimum or maximum) of some function defined in a finite or infinite set. If the functions used for the problem formulation are continuous or piecewise continuous, we obtain a continuous optimization problem. When some practical task is formulated in the form of optimization problem, the success in its solution depends mainly on the quality of a mathematical model and on the choice of an appropriate method of the model solution. Data mining today can be considered as an interdisciplinary research which employs applied mathematics and computational statistics. It treats data obtained in experiments, records, measurements, questionnaires, etc., and aims at modeling, prediction and decision support. In the paper we consider continuous optimization methods in solution of some specific problems arising in data mining: clustering, classification and regression. The results discussed are recently obtained by the members of EURO Working Group on Continuous Optimization (EUROPT) and are based on the theory and methods of continuous optimization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Integrated DEA and Data Mining Approach for Performance Assessment

This paper presents a data envelopment analysis (DEA) model combined with Bootstrapping to assess performance of one of the Data mining Algorithms. We applied a two-step process for performance productivity analysis of insurance branches within a case study. First, using a DEA model, the study analyzes the productivity of eighteen decision-making units (DMUs). Using a Malmquist index, DEA deter...

متن کامل

Using data mining techniques for predicting the survival rate of breast cancer patients: a review article

    This review was conducted between December 2018 and March 2019 at Isfahan University of Medical Sciences. A review of various studies revealed what data mining techniques to predict the probability of survival, what risk factors for these predictions, what criteria for evaluating data mining techniques, and finally what data sources for it have been used to predict the surv...

متن کامل

A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...

متن کامل

S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization

Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...

متن کامل

The application of data mining techniques in manipulated financial statement classification: The case of turkey

Predicting financially false statements to detect frauds in companies has an increasing trend in recent studies. The manipulations in financial statements can be discovered by auditors when related financial records and indicators are analyzed in depth together with the experience of auditors in order to create knowledge to develop a decision support system to classify firms. Auditors may annot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008